(19) 



3 



Europaische^Patentamt 
European Patent Office 
Office europeen des brevets 




(11) 



EP 1 156 107 A1 



(12) 



EUROPEAN PATENT APPLICATION 

published in accordance with Art. 158(3) EPC 



(43) Date of publication: 

21 .1 1 .2001 Bulletin 2001/47 

(21) Application number: 00904009.8 

(22) Date of filing: 17.02.2000 



(51) intci 7 : C12N 15/10, C12Q 1/68 

(86) International application number: 
PCT/JPOO/00902 

(87) International publication number: 

WO 00/49143 (24.08.2000 Gazette 2000/34) 



(84) Designated Contracting States: 

AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 
MC NL PT SE 

(30) Priority: 17.02.1999 JP 3853899 

(71) Applicant: Ajinomoto Co., Inc. 
Tokyo 104-0031 (JP) 

(72) Inventors: 

• MAEKAWA, Takami, Ajinomoto Co.lnc, 
Cent.Res.Lab. 

Kawasawa-shi, Kanagawa 210-0801 (JP) 

• MITSUI, Akira, Ajinomoto Co.lnc, 
Pharma.Res.Lab. 

Kawasawa-shi, Kanagawa 210-0801 (JP) 



• DATE, Masayo, Ajinomoto Co.lnc, Cen.Res.Lab. 
Kawasawa-schi, Kanagawa 210-0801 (JP) 

• FUKUDA, Hisao, Ajinomoto Co.lnc, 
Cent.Res.Lab. 

Kawasawa-schi, Kanagawa 210-0801 (JP) 

• TAKAHARA, Yoshiyuki, Ajinomoto Co.lnc, 
Cent.Res. 

Kawasawa-schi, Kanagawa 210-0801 (JP) 

(74) Representative: HOFFMANN - EITLE 
Patent- und Rechtsanwalte 
Arabeliastrasse 4 
81925 Munchen (DE) 



(54) METHOD FOR ANALYZING GENE EXPRESSION FREQUENCY 



(57) In gene expression frequency analysis, a vec- 
tor primer to which each cDNA is ligated is formed by 
synthesizing the cDNA from each mRNA using by a vec- 
tor primer having a poly(T) sequence; each cDNA se- 



quence is converted to a tag on the vector primer; a con- 
catemer is formed by ligating the obtained tags via a se- 
quence that enables recognition of ends of the tags; and 
the nucleotide sequence of the concatemer is analyzed. 
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Description 

Technical Field 

5 [0001] The present invention relates to a method for analyzing expression frequencies of genes. More precisely the 
present invention relates to a method that enables analysis of Types and amounts of mRNA expressed from all of genes 
coding for proteins in a cell even with a small amount of biosample in order to elucidate dynamic change of qene 
expression. 

10 Background Art 

[0002] The total number of genes on the genome coding for proteins is expected to be about 100 000 for human 
As for yeast, of which total genomic structure have already been elucidated, the number of genes coding for proteins 
are estimated to be about 5000. 

[0003] In recent years, public gene databanks have been established mainly in Europe, the United States and Japan 
An enormous amount of gene information has been registered at such databanks from all over the world, and further 
information is newly coming together into the databanks every day: The human genome project is currently being 
pursued in a worldwide scale aiming at elucidating the total genes of human genome by the year of 2005. and the gene 
information obtained in that project is also being registered at the databanks. By inquiring to such databanks about a 
certain gene sequence, one can know if any gene having the same sequence as, or analogous sequence to the gene 
sequence has already been registered or not and, if registered, information concerning the sequence including desig- 
nation and function of the gene, related references and so forth. Such a search is called homology search. There are 
several kinds of software for performing homology search . However when a large n umber of samples must be searched 
for, BLAST is usually used, of which searching time is short. 
25 [0004] Usually, all of genes contained in a cell are not necessarily transcribed into mRNA so as to produce proteins 
from mRNA, and it is estimated for human that about 15000 genes are expressed in one cell. Thus, in a cell, many 
kinds of genomic genes are expressed, and a corresponding number of types of messenger RNA (henceforth referred 
to as "mRNA") are produced. However, types and amounts of expressed genes (also referred to as "genetic expression 
frequency information" hereinafter) may vary depending on the types and conditions of cells. For example, when a 
30 blood stem cell differentiates into a lymphocyte precursor cell, pre B cell, B cell, and then activated B cell, each cell 
shows entirely different gene expression, although there are also genes that are commonly expressed in them. 
[0005] Measurement of such genetic expression frequency information as described above is called genetic expres- 
sion profile analysis. Substances responsible forcellular life activities are mainly proteins, and it is important to analyze 
types and amounts of proteins translated from mRNAs as the genetic expression analysis. However, it is technically 
35 difficult under a current situation to obtain profiles for the total proteins. On the other hand, measurement of total types 
of mRNA has already become possible. 

[0006] The method reported for the first time as the genetic expression profile analysis method is the Body Map 
method (Gene, 174, 151-158 (1996)). The outline of the Body Map method is as follows. A poly(T) sequence on a 
vector is annealed to a po!y(A) tail at the 3' end of each mRNA, and a cDNA is synthesized by using the vector poly 

40 (T) sequence as a primer. Further, the cDNA is digested with a restriction enzyme Mbo\. Since one of Mbo\ site exists 
in every 300 base pairs of cDNA in average, the cDNA on the vector is digested into a length of 300 base pairs in 
average. At this time, a cDNA fragment nearest the poly (A) tail remains ligated to the vector. The vector having this 
cDNA fragment is each cyclized and introduced into Escherichia coliio prepare a cDNA library. About 1 000 clones are 
arbitrarily selected from the library, and the nucleotide sequence for 300 base pairs in average of each clone is deter- 

45 mined. The clones were divided into groups of clones having the same sequence, and type and occurring frequency 
of each sequence are calculated to obtain genetic expression profile. Homology search (BLAST search) is performed 
in a databank for each cDNA sequence, and clones containing genes having the same sequence as known genes are 
given the names of the genes. When the sequence is not registered at the databank, it is considered that no gene 
corresponding to the sequence exists. 

50 [0007] In order to perform homology search by the BLAST search, information for at least 1 1 base pairs is required. 
The types of sequences consisting of 1 0 nucleotides are about 1 ,000,000, and this number is far beyond the number 
of gene types of which existence is expected in human, i.e., 100,000. That is, if there is information for 11 base pairs, 
a gene having the sequence can be identified and thus the genetic expression profile analysis is possible. Therefore, 
if, aiming at increasing the efficiency of the genetic expression profile analysis by Body Map which requires much 

55 sequencing, cDNA fragments of about 300 base pairs used in Body Map are further made into short fragments of 11 
base pairs or more (called "tag"), many of these fragments are ligated and inserted into a vector to prepare a library 
of ligated tags, about 1 000 clones are arbitrarily selected as in Body Map, and DNA sequences of the ligated tags are 
determined, it is expected that more genetic expression information can be obtained with the same labor compared 
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Sa^ame^ 

HSSSSr i f^SS^^e^Xl ! of which * end is bonded wit, biotin as a prince 
cDNA is dtaested with a restriction enzyme such as MM (called an "anchoring enzyme") as in Body Map, cDNA 
Solnt ?Saininc the 3' end to which biotin is bonded are adsorbed on avidin beads, the beads are divided into 
Sof port ions ^ " S two kLs of linkers (A or Bj are each ligated to the cDNA fragments (about 13 bp) 
eThe of the two portions of the beads. Each linker contains a site for a Class II restr.ct.on enzyme •uchasfJmH 
Sw a "tagging enzyme") . Each cDNA fragment is excised from the beads with the tagging enzyme, the excised end 
s b um ended and the tags ligated to the linker A and the linker B are connected. The product of the connection is 
caSed a "di ag ''. The ditag is amplified by PCR using primers that recognize the linker A and the linker B. A large number 
of amplified ditags are ligated, inserted into a vector, and sequenced. About 50 tag sequences can be obtained by 
onS !5 sequencing. By calculation based on this tag sequence information, genetic expression frequences are pro- 
moot] Further, as other methods for analyzing expression frequencies of genes, there are the gene chip method 
and the gene microarray method, in both of the methods, there are used gene fragments adhered m array to a i suitab e 
plate (usually slide glass) at an extremely high density (about 1 0 fragmentsW or more). The gene fragments on this 
chip are hybridized with fluorescence-labeled mRNAs to determine types and amounts of mRNAs. 
,00101 As described above, several methods have been developed for analyzing express.on frequences of genes, 
and fair resufts have been obtained. Currently, the SAGE method is the most effective means for measunng express.on 
frequencies of the total genes of all eukaryotic organisms. However, when this method was actually practiced ,t en- 
countered many problems, and the SAGE method could not be reproduced in most of research facil, ties. That s, the 
techniques required forthe SAGE method are complicated and they can be performed only by specally trained person^ 
Further about 1 M of mRNA is required for the measurement, and therefore it is substantially impossible to perform 
the measurement with a sample that can be obtained in a small amount, for example, a clinical biopsy material or to 
measure difference of genetic expression in micro tissue portions. Furthermore, the method theoretically causes con- 

siderable measurement errors. ThiciehaMnce 
room In the SAGE method, it is extremely important to accurately determine a sequence of the tag. This is because 
he tag is short (about 13 bp) and therefore, if even only one of nucleotides is erroneously determined it may be 
determined to be a different sequence even though it is the same sequence, or a different sequence may be 'determined 

35 to be the same sequence. However, such an erroneous determination is likely to occur .n the SAGE method. Tm m 
because in the SAGE method, two tags are connected to form a ditag and the border between the tags becomes 
indefinite. The tag is a short gene fragment excised by a restriction enzyme such as BsmF, ^^^^"^ 
sites of these enzymes are not always stable, and the lengths of the excised tags are diversified. Thus, f the tags are 
connected to form a ditag in the state that tags of different lengths are intermingled, it becomes mde finite » fromwh«h 

40 tag nucleotides at the ligation site of the tags have been derived. As a result, it becomes '^possible to ob a. a correct 
sequence of the tag. Thus, the SAGE method suffers from a theoretically inevitable drawback. Furthermore, the SAGE 
method involves an operation of collecting DNA using avidin and biotin beads. However, ,t is in fact extremely Oflfcutt 
to collect DNA by using avidin and biotin beads without inviting contamination, and it is extremely difficult to obtain 
correct data by operating according to the protocol as it is. Moreover, the SAGE method requires a large amount of 

45 mRNA in order to obtain data. Therefore, in a case of a sample of which amount is limited, for example, clinical samples, 
a sufficient amount of mRNA cannot be obtained and thus the SAGE method cannot be performed. 
[0012] Furthermore, in the gene chip method and the gene microarray method, the measurement can be performed 
only for a gene of which structure is known, unlike the Body Map method or the SAGE method. Therefore, under the 
current situation, expression frequencies of total genes of all organisms cannot be measured. 

so r0013] Currently, the SAGE method is the most effective means for measuring expression frequences of the total 
genes of all eukaryotic organisms. However, when this method was actually practiced, it encountered many Problems 
and the SAGE method could not be reproduced in most of research facilities. The drawbacks of the SAGE method are 
as follows- (1) the techniques required for the method are complicated and they can be performed only by specially 
trained persons; (2) about 1 ng of mRNA is required for the measurement, and therefore it is substantially imposs.ble 

55 to perform the measurement with a sample that can be obtained in a small amount, for example, a clinical biopsy 
material, and it is similarly impossible to measure difference of genetic expression in micro tissue portions; and (3) tne 
method theoretically causes considerable measurement errors because a ditag is measured. 
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Disclosure of the Invention 

[0014] The present invention was accomplished in view of the aforementioned current situation, and an object thereof 
IS to prov.de a method that can bo oasily p.rformed by ordinary researchers and enables accurate genetic expression 
frequency analysis with a micro amount of a specimen. genetic expression 

[0015] The inventors of the present invention assiduously studied in orderto achieve the aforementioned object As 
a result, they found that expression frequencies of genes could be efficiently analyzed with high precision by synthe- 
sizing each cDNA from each mRNA using a vector primer having a poly(T) sequence, converting each cDNA sequence 
to a tag on the vector, forming a concatemer by ligating the obtained tags via a sequence that enables recognition of 
ends of the tags, and analyzing a nucleotide sequence of the concatemer, and thus accomplished the method of the 
present invention designated as "MAGE (Micro-analysis of Gene Expression)". 
[0016] That is, the present invention provides the followings. 

(1) A method for analyzing expression frequencies of genes, which comprises the following steps: 

(a) a step of forming a vector primer to which each cDNA is ligated, by annealing the vector primer with each 
mRNA derived from a cell of which expression frequencies of genes is to be analyzed, and synthesizing the 
cDNA, said vector primer comprising a linear plasmid vector having a single-stranded poly(T) sequence at 
one 3' end, a recognition sequence for a first restriction enzyme in an inner position from the poiy(T) sequence 
a recognition sequence for a second restriction enzyme near the other end, and a recognition sequence for 
a type IIS restriction enzyme in an inner position from the recognition sequence for the second restriction 
enzyme, 

(b) a step of digesting the vector primer to which the cDNA is ligated, with the second restriction enzyme and 
a third restriction enzyme that does not digest the vector primer and forms a digested end of the same shape 
as a digested end obtained with the second restriction enzyme : to excise an upstream region of the cDNA, 
and cyclizing the vector primer, 

(c) a step of digesting the cyclized vector primer with the first restriction enzyme and the type IIS restriction 
enzyme to excise a downstream region of the cDNA so that a tag consisting of a part of the cDN A is left, and 
cyclizing the vector primer again, 

(d) a step of performing polymerase chain reaction (PCR) by using the vector primer as a template and oligo- 
nucleotides having nucleotide sequences corresponding to respective flanking regions of the both sides of the 
tag contained in the vector primer as primers to amplify the tag, 

(e) a step of ligating the amplification products to form a concatemer of the tags, and 

(f) a step of determining the nucleotide sequence of the concatemer and investigating types and frequencies 
35 of tags occurring in the nucleotide sequence. 

(2) The method according to (1), wherein the ligation reaction in the step (e) is performed in the presence of an 
adaptor having one end of the same shape as an end of the tag to arrange the adaptor at each end of the con- 
catemer, and the concatemer is amplified by performing PCR using an oligonucleotide having a sequence corre- 

40 sponding to the sequence of the adaptor as a primer. 

(3) The method according to (1) or (2), wherein, after the step (e), the concatemer is cloned in a cloning vector for . 
nucleotide sequencing, and then the nucleotide sequence of the concatemer is determined. 

(4) The method according to any one of (1 ) to (3), wherein the recognition sequence for the third restriction enzyme 
consists of four nucleotides. 

45 (5) The method according to any one of (1) to (4), wherein the* vector primer has a recognition sequence for a 

fourth restriction enzyme of which digestion point is in the same position as or an inner position from the digestion 
point of the recognition sequence for the second restriction enzyme, which is not excised from the vector primer 
by the digestion with the type IIS restriction enzyme; 

so the primerforthe downstream side of the tag among the primers used in the step (d) has a recognition sequence 

for a fifth restriction enzyme that forms an end of the same shape as the end digested with the fourth restriction 
enzyme; and 

the concatemer is formed after the amplified primers are digested with the fourth restriction enzyme and the 
fifth restriction enzyme. 



30 



55 



(6) The method according to (5), wherein the vector primer has a nucleotide sequence different from the recognition 
sequence for the fifth restriction enzyme by one nucleotide in an inner position from the recognition sequence for 
the first restriction enzyme, and the nucleotide sequence different by one nucleotide is converted to the recognition 
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f ^r the fifth rMtriction enzyme by PCR using the primer for the downstream side of the tag. 
m Z metho a^rdTnfl to e" wherein the third, fourth and fifth restriction enzymes are identical to one another 
?xU t^H^S to anv one of (1) to (7), wherein the vector primer is formed by ligating a linear plasmid 

SSS il of the same shape as one end of the .inear p.asmid and a single-stranded po.y(T) 
sequence. 
Brief Explanation of the Drawings 

r0017l Fig 1 shows an exemplary structure of plasmid DNA for producing a vector primer (MAGE/pUC19). 

0018 Fiq 2 shows an exemplary structure of a vector primer and a construction process therefor. 

0019 Fig 3 schematically shows the steps (a) and (b) of the method of the present invent.on. 
00201 Fig. 4 schematically shows the steps (c) and (d) of the method of the present invent.on. 

0021] Fig 5 schematically shows the step (e) of the method of the present invent.on and a step of inserting an 
is amplification product obtained in the step (e) into a cloning vector for sequencing. 

Best Mode for Carrying out the Invention 

[0022] The method of the present invention comprises the aforementioned steps (a) to (f). Hereafter, each step will 
20 be explained. 

<1> Step (a) 

r00231 In the first step, a vector primer comprising a linear plasmid vector having a single-stranded poly(T) sequence 
at one 3' end, a recognition sequence for a first restriction enzyme in an inner position from the poly(T) sequence a 
recognition sequence for a second restriction enzyme near the other end, and a recognition sequence for a type IS 
restriction enzyme in an inner position from the recognition sequence for the second restriction enzyme ,s used (also 
referred to as a "the vector primer for reverse transcription" hereinafter). The type IIS restriction enzyme means a 
restriction enzyme that cleaves a specific position remote from a sequence recognized by the restriction enzyme. 
[0024] An exemplary structure of the vector primer is shown in Fig. 2, D. This vector pnmer can be prepared by, for 
example, ligating a linear plasmid obtained by digesting a plasmid having a multicloning site at two sites in tho rrnjtti- 
cloning site, and a partially double-stranded DNA having an end of the same shape as one end of the linear plasm d 
and a single-stranded poly(T) sequence. The structure of M AGE/pUC1 9 is shown in F,g. 1 as an example of the plasmid 
having a multicloning site. This primer comprises pUC19, which is a known cloning vector, as a basic structure, and a 
sequence containing various restriction enzyme recognition sites (referred to as "Z fragment", sequence is shown in 
SEQ ID NO 1) is inserted between the EcoRI and Hin6\\\ restriction enzyme sites of the multicloning site thereof. 
[0025] lnthefollowingspecificexample,explanationwillbemadeforanexamplethatutilizesMAGE/pUCf9prepared 
D y using an Escherichia coli host having Dam mediation system, for example, JM1 09 strain etc. However a is not 
essential for the present invention. Further, the vector used as the basic structure ,s not l.mitedto pUC 19. i and va nous 
S vectors such as pUC18, P BR322, P HSG299, pHSG298, P HSG399, pHSG398, RSF1010 P MW119 pMWIIB 
DMW219 and pMW218 can be used. In addition, the host of the vector is not particularly limited, so long as it is one 
for which ordinary genetic recombination techniques such as transformation and recovery of vectors from the host can 
be used. However, Escherichia coli is usually used. 

[0026] An example utilizing MAGE/pUC1 9 as the plasmid having a multicloning site will be explained hereafter. How- 
45 ev er, the present invention is not limited to this example. .. ori 
[00271 First MAGE/pUC1 9 vector (Fig. 2, A) is digested with restriction enzymes SsfXI and Psti, and the excised 
small fragment is removed (Fig. 2, B). Then, a partially double-stranded DNA having a single-stranded poly(T) sequence 
(polyfT) adaptor shown in Fig. 2, C) is ligated to the BsfXI-digested end. This provides a vector primer (or a vector 
primer for reverse transcription) in which the single-stranded poly(T) protruding from one end of the MAGE/pUC19 
so vector (Fig. 2, D). In this vector primer, the first restriction enzyme is Pme\, the second restriction enzyme is Bg/IL and 
the type IIS restriction enzyme is Bsg\. 

[0028] The recognition sequence for the second restriction enzyme BgM contains an overlapping Mbo\ recognition 
sequence, and therefore if the BglU end is ligated to the Mbo\ end, it can be digested with Mbo\. Further, in this example 
the vector primer is designed to contain a nucleotide sequence (AMbol of Fig. 3, E) that is different from the fourth 
55 restriction enzyme (Mbo\) recognition sequence by one nucleotide in a further inner position from the first restriction 
enzyme (Pme\) recognition sequence. This AMbol and the first restriction enzyme recognition sequence of the vector 
primer are derived from the poly(T) adaptor. 

[0029] The first restriction enzyme and the second restriction enzyme are not particularly limited, so long as they 
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digest the vector primer at one position. Further, the type IIS restriction enzyme and its recognition sequence-existing 
position are not particularly restricted, so long as it does not excise the fourth restriction enzyme site from the vector 
primer, and it digests the upstream region of the cDNA so that a part of the cDN A is left on the vector primer Specifically 
it may be, for example, BsmF\ etc.. in addition to the aforementioned Bsg\. 

[0030] Further, the length of the poly(T) sequence may be a length that enables annealing with the poly(A) sequence 
of mRNA, and it is usually about 10-50 nucleotides. 

[0031] Such a vector primer as mentioned above and mRNA derived from a cell for which expression frequencies 
of genes are to be analyzed are annealed. When a reverse transcription reaction is performed in that state the poly 
(T) serves as a primer and cDNA synthesis is initiated (Fig. 3, E). Then, the second strand can be synthesized by using 
the synthesized first strand of cDNA as a template to synthesize a double-stranded cDNA (Fig 3 F) While a large 
number of vector cDNA-ligated primer molecules (cDNA-MAGE/pUC1 9) can be produced from a large number of 
mRNA molecules, Fig. 3, F represents one of typical example thereof. 

[0032] The mRNA is extracted from a cell for which expression frequencies of genes are to be analyzed Any cell 
can be used as the cell for which expression frequencies are to be analyzed without any particular limitation so long 
as it is a cell in which 3' end of mRNA has the poly(A) structure, and it may be a tissue cell of an animal or a plant a 
cell of a microorganism such as yeast. Further, mRNA of a prokaryote does not have the poly(A) structure at the 3' 
end and therefore it cannot be annealed as it is to the poly(T). of the vector primer. However, by enzymatically adding 
a poIy(A) structure to the mRNA, the method of the present invention can be performed in a manner similar to that for 
mRNA of an eukaryotic organism. 

[0033] Operations such as preparation of mRNA, cDNA synthesis, synthesis of oligonucleotide, reaction with a re- 
striction enzyme, ligation reaction, polymerase chain reaction (PGR), and transformation can be performed in the same 
manner as methods for preparing mRNA used for usual cDNA cloning (see, for example, Sambrook J., Fritsch. E.F., 
and Maniatis, T. "Molecular Cloning A Laboratory Manual, Second Edition", Cold Spring Harbor Laboratory Press 
(1989) etc.). 

<2> Step (b) 



[0034] Then, the vector primer to which cDNA is ligated, obtained as described above, is digested with the second 
restriction enzyme and a third restriction enzyme that does not digest the vector primer and forms a digested end of 

30 the same shape as a digested end obtained with the second restriction enzyme, to excise an upstream region of the 
cDNA, and the vector primer is cyclized. The cyclized vector primer can be amplified by introducing it into a suitable 
host, culturing the obtained transformant and collecting a plasmid. As will be similarly applied to cyclized vector primers 
used in the subsequent steps, when the restriction enzyme site for cleaving the obtained vector primer cannot be 
digested due to Dam methylation, a host that does not have the modification system is used. 

35 [0035] As the third restriction enzyme, an enzyme recognizing four nucleotides is preferred. If an enzyme recognizing 
six nucleotides is used, such a restriction enzyme site may not exist in the cDNA sequence. Further, if the sequence 
of cDNA left on the vector primer is long, the tag obtained by the subsequent procedures will become farfrom the poly 
(A) sequence of mRNA. In such a case, since the expression information of genes in a database (EST: expressed 
sequence tag) is usually for parts of the 3' end side of mRNA, the target sequence may not be retrieved even if tag 

40 sequences are searched. Examples of restriction enzyme suitable for the present invention, which recognize four 
nucleotides, include Mbo\ } Tail and so forth. 

[0036] Hereafter, an example utilizing Mbo\ as the third restriction enzyme will be explained. First, the cDNA-ligated 
vector primer (cDNA-MAGE/pUC19) is digested with restriction enzymes BglU and Mbo\ (Fig. 3, G). Since MAGE/ 
pUC19 have been Dammethylated at this time, it is not digested with Mbo\ } and only the Mbo\ site newly synthesized 
45 in the cDNA by the reverse transcription is digested. Fig. 3, G shows a temporary example where the cDNA contains 
three of Mbo\ sites. As forthe cDNA, a downstream region from the poly(A) tail to the first Mbo\ site toward the upstream 
region remains ligated to the M AGE/pUC 1 9 vector primer, and the other portion, i.e. , the upstream region of the cDNA 
is excised and removed by the Mbo\ digestion. As for MAGE/pUC19, it is digested with Bgl\\ at only one position. The 
end portions digested with Mbo\ and 8o7ll are in the same shape, and they can be ligated by a ligation reaction. There- 
to fore, the cDNA-MAGE/pUC1 9 is cyclized by a selMigation reaction via these ends (Fig. 3, H). 

<3> Step (c) 

[0037] The vector primer cyclized in the step (b) is digested with the first restriction enzyme and the type IIS restriction 
55 enzyme to excise the downstream region of the cDNA, so that a tag consisting of a part of the cDNA is left, and the 
vector primer is cyclized again. 

[0038] Specifically, the cyclized cDNA-M AGE/pUC1 9 is digested with restriction enzymes Bsg\ and Pmel (Fig. 4, I). 
After digestion with these restriction enzymes, only about 13 nucleotides farthest from the poly(A) tail of the 3' end 
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portion of the cDNA remain on the vector primer. This sequence of cDNA consisting of about 13 nucleotides is called 

^ST^^^^^'^ protrudes by 2 bp at the end digested with Bsgl it is blunt-ended by T4 DNA 
nrSvmlr ase treatment for example. Further, other end digested with Pmsl is a blunt end. Therefore, these ends can 
S3 Tj) Thus, if the vector primer is cyclized by a self-ligation reaction, a vector primer hav.ng a structu e 
compSfMAGE/pUCig to which a short tag is ligated, in which the part of cDNA other than the tag ,s exc.sed, . 
obtained (Fig. 4, J and K). 



<4> Step (d) 



r00401 Then PCR is performed by using the vector primer cyclized in the step (c) as a template, and oligonucleotides 
having nucSide sequences corresponding to respective flanking regions of the both sides of the tag ,n the vector 

[00*5 "^K.'rem.^ ^ed by using the vector primer containing the tag as a temple, and primers cor- 
responding to the sequences of the vector segments of the both sides of the tag (for example, those of SEQ ID NOS 
2 and 3) (Fig. 4, K) As a result, a DNA fragment containing the tag at the center is amplified (in this case, 110 bp 
including the sequences of primers and the tag) (Fig. 4, L). 



55 



<5> Step (e) 



[0042] The aforementioned PCR amplification products are ligated to form a concatemer of the tags. At this time rf 
a recognition sequence for a fifth restriction enzyme that produces an end of the same shape as the end obtained by 
digestion with the fourth restriction enzyme is introduced into the primer for the downstream reg.on of the tag among 
the primers used in the step of (d), and the amplified primer is digested with the fourth restriction enzyme and the fifth 
25 restriction enzyme, the concatemer can be formed efficiently. The type of the fifth restriction enzyme is not P^arty 
,imrted. However, if the same restriction enzyme as the fourth restriction enzyme is used, the procedure w.ll become 
simple Further, the same restriction enzyme as the third restriction enzyme may be used as the fourth ^tnctoon 
enzyme Furthermore, the third, fourth and fifth restriction enzymes may be the same enzyme. However, rf he same 
enzyme is used as the fifth restriction enzyme and fourth restriction enzyme, the total sequence of the cDNA w,l be 
so excisedfromthevectorprimerinthestep(b).Therefore,inordertopreventthis, itisnecessaryto.ntroduceanucleot.de 
sequence different from the recognition sequence for the fifth restriction enzyme by one nucleotide at ^sitionin 
further inner position from the first restriction enzyme recognition sequence of the vector pnmer, and perform PCR by 
using a primer having a recognition sequence for the fifth restriction enzyme for the downstream side of the tag I so that 
the nucleotide sequence different by one nucleotide is replaced with the recogn.t.on sequence for the fifth restnct on 
35 enzyme. Further, when the same enzyme is used as the third restriction enzyme and fourth restnct.on enzyme the 
recognition site of the vector primer for the fourth restriction enzyme must be made to be not digested by methylat.on_ 
In this case, before digestion with the fourth restriction enzyme, the vector primer cyclized in the step (c) is introduced 
into the host not having the modification system for demodification. 

[0043] If the ligation reaction of the tags is performed in the presence of an adaptor hav.ng one end of the same 
40 shape as an end of the tag to arrange the adaptor at each end of the concatemer, and PCR is performed by us.ng an 
oligonucleotide having a sequence corresponding to the sequence of the adaptor as a primer, the concatemer can be 
amplified If the adaptor is used in a smaller amount compared with the tag, or an adaptor hav.ng one end of a shape 
different from that of the tag, the adaptor can be arranged at each end of a concatemer consisting of a large number 
of tags, in this way, even a micro amount of concatemer can be cloned, and as a result, analysis becomes possible 
4s with a micro amount of sample mRNA. 

[0044] The molar ratio of the tags and the adaptor is usually tag:adaptor = 1 :1 to 1 :0.01 , preferably 1 .0.2 to 1 .0.05. 
If the adaptor is used in this range, the concatemer composed of about 2 to 50 ligated tags may be obtained. 
[0045] insertion of the obtained concatemer into a cloning vector for nucleotide sequenc.ng makes the sequencing 

so [0046]' Or When' MAGE/pUC19 is used, a small fragment containing a tag is obtained by digesting the amplification 
product of the step (d) with a restriction enzyme Mbo\ (Fig. 5, M). The tag fragment obtained here is present between 
known DNA sequences i e., GATC and AAACG, and therefore it can definitely recognize which portion is the tag. 
Since Mbo\ sites are exposed at the both ends of this tag sequence, the tags can ligated to each other by a ligation 
reaction to form a concatemer (Fig. 5, M). 



<S> Step (f) 

[0047] If the nucleotide sequence of the concatemer obtained as described above is determined, since tags derived 
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from many cDNA molecules are contained in it, the expression frequencies of the genes from which the cDNAs orig- 
inated can be analyzed by investigating types and frequencies of the tags occurring in the nucleotide sequence Type 
of the tag can be determined by searching for a database containing information of partial sequences of known mRNAs 
(EST). 

[0048] When the concatemer of the tags is- amplified,- two or more fragments having the same sequence will be 
formed. However, if a sequence considered to be the same as one previously sequenced appears in the sequence 
analysis after the cloning, it can be excluded from the analysis to eliminate bad influence of PCR on the genetic ex- 
pression frequency analysis. 

[0049] Further, when the same enzyme is used as the fourth and fifth restriction enzymes, since direction of thetaqs 
in the concatemer is not fixed, sequences of not only one strand but also the reverse strand are taken into consideration 
in the sequence analysis. 

[O050] As explained above, in the method of the present invention, a ditag is not produced as in the SAGE method 
but a tag can be prevented from directly connecting with other tags by putting each tag between known DNA fragments' 
As a result, the problem that the border between tags becomes indefinite is solved. Further, since a sample can be 
amplified by performing PCR several times, the analysis can be possible even with a micro amount of mRNA Further- 
more, the use of vector primer enables cDNA synthesis in the state that the cDNA is fused to the vector. Therefore 
the analysis can be performed without using avidin and biotin beads. 

Example 

[0051] Thirty five micrograms (35 u,g) of mRNA was extracted from 1 g of C57BL/6 mouse livers by using Fast Track 
2.0 kit produced by Invitrogen. 

[0052] After cDNA was synthesized through reverse transcription by using 0.97 u.g of the obtained mRNA and MAGE/ 
pUC19, genetic expression frequency analysis was performed as follows. 

[0053] The MAGE/pUC19 vector (Fig. 2, A) was digested with restriction enzymes BsC<\ and Psfl, and the excised 
small fragment was removed (Fig. 2, B). Then, a partially double-stranded DNA having a single-stranded poIy(T) se- 
quence (poly(T) adaptor shown in Fig. 2, C) was ligated to the BstX I -digested end (Fig. 2, D). In this vector primer, the 
first restriction enzyme was Pme\ } the second restriction enzyme was eg/I I, and the type US restriction enzyme was 
Bsg\ (Fig. 3, E). 

30 [0054] Zero point two microgram (0.2 p.g) of the vector primer and 0.97 u.g of the mRNA derived from mouse liver 
were annealed, and cDNA synthesis was performed. Then a second strand was synthesized by using the synthesized 
first cDNA as a template (Fig. 3, F). Genetic expression frequency analysis was performed by using the obtained cDNA 
in an amount of 1/40 of the obtained amount of cDNA (corresponding to 0.025 jig of mRNA) as a material. 
[0055] In another experiment, genetic expression frequency analysis could be successfully performed by using 0.05 

35 jag of micro amount mRNA extracted from 4.5 mg of micro amount clinical sample as an experimental material to 
synthesize cDNA through reverse transcription and following the same procedures as described below. 
[0056] Then, the vector primer to which the cDNA was ligated obtained as described above was digested with BgtW 
and Mbo\, which were the second and third restriction enzymes, respectively (Fig. 3, G). Subsequently, the Mbo\ end 
and BglW end of cDNA-MAGE/pUC19 were ligated by a selMigation reaction to cyclize the vector primer (Fig. 3, H). 

40 [0057] The above cyclized vector primer was digested with Bsg\ and Pme\, and the digested ends were blunt-ended 
by T4 DNA polymerase treatment. Then, the vector primer was cyclized again (Fig. 4, I to K). PCR was performed by 
using-this cyclized vector primer as a template, oligonucleotides having nucleotide sequences corresponding to the 
respective flanking regions of the both sides of the tag in the vector primer (SEQ ID NOS: 2 and 3) as primers, and an 
enzyme AmpiiTaq Gold (PE Biosystems) to amplify the tag (Fig. 4, L). The above PCR was performed by repeating a 

is reaction cycle of denaturation (95°C, 0.3 minute), annealing and an extension reaction with the polymerase (72°C, 1 .5 
minutes) for 65 cycles. As a result, a DNA fragment of 110 bp containing the tag at the center was amplified. 
[0058] The above PCR amplification product was digested with Mbo\ to excise the tag, and a ligation reaction was 
performed with addition of an adaptor obtained by annealing oligonucleotides having the nucleotide sequences shown 
in SEQ ID NOS: 4 and 5 such that the ratio of tag:adaptor should be 8:1 to form a concatemer of the tags (Fig. 5, M). 

50 This concatemer was amplified by PCR using a primer having the nucleotide sequence shown in SEQ ID NO: 6. For 
PCR, a reaction cycle of denaturation (95°C, 0.3 minute), annealing (40°C, 3 minutes) and an extension reaction with 
polymerase (72°C, 1 minute) was repeated for 5 cycles, and then a reaction cycle of denaturation (95°C, 0.3 minute), 
annealing and an extension reaction (72°C, 1 minute) was repeated for 60 cycles. The amplification product was 
digested with a restriction enzyme Afofl, and inserted into the. Not\ site of the cloning vector pKF3 for sequencing, and 

55 the nucleotide sequence was determined. 

[0059] Based on the obtained sequence data (for 11000 tags), the tags were divided into groups of tags having the 
same sequence, and the order of the groups according to the occurring frequency was determined. The tags ranked 
first to tenth for the frequency from the highest are shown in Table 1. This result shows that the gene showing the 
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gene, the urinary . protein m ^^J^JS^STJS rnRNA samp.e, the method of the present invention 
23^^^^S «^A*5 fhe use of a vec. pr-mer, an d ena.ea 9 ene ti c expression fluency 
analysis with high precision since ambiguity of tag sequences can be eliminated. 

Table 1 



30 



35 



40 



45 



lag (otU lU Invj; 


Rank for freauency 


Number of. tag 


Designation of gene 


TGCATTCCATC (7) 


1 


2196 


Urinary protein l/ll 


CCTGGTGGAAA (8) 


2 


1261 


Albumin 


TGCTCTCCACC (9) 


3 


485 


Urinary protein III 


GGGAAGTACGC (10) 


4 


383 


Argininosuccinate synthase 


ACCTCGGATGA (11) 


5 


345 


Fibrinogen Aa 


TTCCAGGCCCG(12) 


6 


333 


Apolipoprotein E 


ACCAGTGTCGC(13) 


7 


310 


Mouse EST 


TGCATGCCCTG (14) 


8 


307 


Ferritin light chain 


CACTACAGCAC (15) 


9 


300 


Mouse EST 


CTGCCAAGTTC (16) 


10 


226 


Retinol-binding protein 



2s Industrial Applicability 

[0060] According to the present invention, expression frequencies of genes can be surely analyzed with good pre- 

m06lT ThTmethTfor analyzing genetic expression of the present invention is useful for researches concerning life 
Se rtE^SZS^ for development of methods for treatment or diagnosis of diseases through analysis 
of difference n genetic expression between specific organs or cells of healthy subjects and subjects wrtli , disease^ 
JSS by analyzing difference in expression of genes between livers of healthy subject and subject w, h hepa U m 
b the method of the present invention, a gene of which expression is specifically increased or decreased m hepat t.s 
can be dickered By investigating the role in the liver of such a gene, drugs for inhibiting or accelerating the f unct.on 
S r c :S de y ve.opedfor tne purpose of treatment of -hepatitis. Moreover, ^^J^^^Z^Z 
nucleotide designed based on the gene structure, a protein produced by expression of the gene and so forth can also 

[ be developed by using the method of the present invention. Furthermore, if a gene of wh.ch express.on |>d»^in 
a disease specific manner can be found by the method of the present invention, it becomes possib.e to develop not 
only a method of treatment, but also a method of diagnosis of a disease. „ Mfmm 
[0063] Moreover, not on^ use in the medical field, the method can also be means for ident.fy.ng useUil genes from 
all of eukaryotic organisms. For example, after yeast strains suitable for the brewing of beer are bred ^ R ation 
change of genetic expression between a parent strain and a mutant strain can be analyzed by the method of the present 
nventfon, so that a gene of which expression is changed by the mutation can be identified. 

nipulating genes advantageous for the brewing of beer, which are obtained as described above, a better brewers yeast 

[00 a 64] Ca Further a by d analyzing amino acid-producing bacteria such as Escherichia coli and Corynebacterium bacteria, 
for example, more excellent amino acid-producing bacteria can be created. 

so 
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SEQUENCE LISTING 
<110> Ajimonoto Co. , Inc. 

<120> Method for analyzing expression frequencies of genes 

<130> B-583AYOP962 

<150> JP 11-38538 
<151> 1999-02-17 

<160>. 16 

<210> 1 
<211> 47 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: multi-cloning site 
<400> 1 

gattcgtgca gatctcacac tgcagagatc caacagcatg gaagctt 

<210> 2 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

acgccagggt tttcccagtc acgacg 

<210> 3 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 3 

atgattacec caagcttcca tgctggctcc gatccgttt 

<210> 4 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: adaptor 



<400> 4 

gatcgagatc tgcaaccaga gtcg 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: adaptor 



<400> 5 

cgactctggt tgcagatctc 

<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

aacttcgact gcggccgcag atctcgatc 

<210> 7 
<211> 39 
<212> DNA 

<213> Mus musculus 



<400> 7 
tgcattccat c 
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<210> 8 . 
<211> 11 
<ZIZ> DNA 

<213> Mus musculus 
<400> 8 

cctggtggaa a j. 

<210> 9 

<211> 11 

<212> DNA 

<213> Mus musculus 

<400> 9 . 

tgctctccac c . ji 

<210> 10 
<211> 11 
<212> DNA 

<213> Mus musculus 
<400> 10 

gggaagtacg c H 

<210> 11 
<211> 11 
<212> DNA 

<213> Mus musculus 



<210> 12 
<211> 11 
<212> DNA 

<213> Mus musculus 
<400> 12 

ttccaggccc g 11 

<210> 13 
<211> 11 
<212> DNA 



40 



<400> 11 
acctcggatg a 
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<213> Mus musculus 

5 <400> 13 n 

accagtgtcg c 

<210> 14 
10 <211> 11 
<212> DNA 

<213> Mus musculus 
<400> 14 

tgcatgccct g 1 

<210> 15 
<211> 11 
<212> DNA 

<213> Mus musculus 

<400> 15 
cactacagca c 

30 <210> 16 
<211> 11 
<212> DNA 

<213> Mus musculus 



20 



25 



35 



40 



45 



<40U> 16 
ctgccaagtt c 



Claims 

1 . A method for analyzing expression frequencies of genes, which comprises the following steps: 



(a) a step of forming a vector primer to which each cDNA is ligated, by annealing the vector primer with each 
mRNA derived from a cell of which expression frequencies of genes is to be analyzed, and synthesizing the 
cDNA, said vector primer comprising a linear plasmid vector having a single-stranded poly(T) sequence at 
one 3' end, a recognition sequence for a first restriction enzyme in an inner position from the poly(T) sequence, 

so a recognition sequence for a second restriction enzyme near the other end, and a recognition sequence for 

a type US restriction enzyme in an inner position from the recognition sequence for the second restriction 
enzyme, 

(b) a step of digesting the vector primer to which the cDNA is ligated, with the second restriction enzyme and 
a third restriction enzyme that does not digest the vector primer and forms a digested end of the same shape 

55 as a digested end obtained with the second restriction enzyme, to excise an upstream region of the cDNA, 

and cyclizing the vector primer, 

(c) a step of digesting the cyclized vector primer with the first restriction enzyme and the type IIS restriction 
enzyme to excise a downstream region of the cDNA so that a tag consisting of a part of the cDNA is left, and 
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cyclizing the vector primer again, 

(d) a step of performing polymerase chain reaction (PGR) by using the vector primer as a template and oligo- 
nucleotides having nucleotide sequences corresponding to respective flanking regions of the both sides of the 
tag contained in the vector primer as primers to amplify the tag. 

(e) a step of ligating the amplification products to form a concatemer of the tags and 

(f) a step of determining the nucleotide sequence of the concatemer and investigating types and frequencies 
of tags occurring in the nucleotide sequence. 

The method according to Claim 1 , wherein the ligation reaction in the step (e) is performed in the presence of an 
adaptor having one end of the same shape as an end of the tag to arrange the adaptor at each end of the con- 
catemer, and the concatemer is amplified by performing PCR using an oligonucleotide having a sequence corre- 
sponding to the sequence of the adaptor as a primer. 

The method according to Claim 1 or 2, wherein, after the step (e), the concatemer is cloned in a cloning vector for 
nucleotide sequencing, and then the nucleotide sequence of the concatemer is determined. 

4. The method according to any one of Claims 1 -3, wherein the recognition sequence for the third restriction enzyme 
consists of four nucleotides. 

5. The method according to any one of Claims 1 -4, wherein the vector primer has a recognition sequence for a fourth 
restriction enzyme of which digestion point is in the same position as or an inner position from the digestion point 
of the recognition sequence for the second restriction enzyme : which is not excised from the vector primer by the 
digestion with the type IIS restriction enzyme; 

the primerfor the downstream side of the tag among the primers used in the step (d) has a recognition sequence 
for a fifth restriction enzyme that forms an end of the same shape as the end digested with the fourth restriction 
enzyme; and 

the concatemer is formed after the amplified primers are digested with the fourth restriction enzyme and the 
fifth restriction enzyme. 

6. The method according to Claim 5, wherein the vector primer has a nucleotide sequence different from the recog- 
nition sequence for the fifth restriction enzyme by one nucleotide in an inner position from the recognition sequence 
for the first restriction enzyme, and the nucleotide sequence different by one nucleotide is converted to the recog- 
nition sequence for the fifth restriction enzyme by PCR using the primer for the downstream side of the tag. 

7. The method according to Claim 6, wherein the third, fourth and fifth restriction enzymes are identical to one another. 



8. The method according to any one of Claims 1 -7, wherein the vector primer is formed by ligating a linear plasmid 
obtained by digesting a plasmid having amulticloningsite at two sites in the murticloning site, and a partially doubie- 
40 stranded DNA having an end of the same shape as one end of the linear plasmid and a single-stranded poly(T) 

sequence. 
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Z fragment (SEQ ID NO: 1) 



EcoR I Bsgl! Mfeol 



Pst I 



BstX I Hindlll 

v , n f , ^ 

GAATTCCTGC>GATCrCACACTGCAGAGATCCAACAGCATGGAAGCTT 
CTTAAGCACGTCTAGAGTGTGACGTCTCTAGGTTGTCGTACCTTCGAA 




Fig. 1 
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: ; Z- fragment 

C<^TTC(TrG<^^T^CA(^CTGa^GAt^^(^GCATC^GarT 
CTT AAGCACGTCTAGAGTGTGACGTCTCTAGGTTGTCGTACCTTCGAA^V 
MAGE/pUC19 ) 



pst r 

BstX x 



B 



I 

^*~Removal 

G AATTCGT GCAGATCTCACACTGCA GAGAI&AACAGC ATGGAAGCTT 
^ ^CTrAAGCACGTCTAGAGTGTG ACGTCTCTAGGTT GTCGTACCTTCGAA"'*^ ^ 

• CGGAGTTTAAACGGATTGGAG CCAGC 

'"'nniiiiiniinr inuinuimini i / gcctcaaatttgcctaacctcg 



c 



polyCO adaptor f Ligation 

T ATGGAAGCT 



ATGGAAGCTT 
GTCGTACCTTCGAA* 



c 



GAATTCGTGCAGATCTCACACTGCA 
CTTAAGCACGT CTAGAGTGTG 




J 



First 

Proe I AMbo I 
CGGA^TTT^^ 

1 1 i i j m in m 1 1 u i n n i n m m n i r m n t gcctcaaatttgcctaacctcggtcgtaccttcgaa 



GAATTCGTGCAGATCTCACACTGCA 
OTAAGCACGTCTAGAGTGTG 



Vector primer for reverse transcription 

Fig. 2 
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First 



E mRNA 

2) Second strand synthesis 



-AAAAAAA AA AAAA AA AAA A A AA AA 



/ 



-AAAAAAAAAAAAAA' 




Fifth 



CGGAGTTT AAAC GGATT (j GAGC CAGCATG<jAAGCTT 
^ T lHHIIUIlillHIHIllllll I \ GCCTCAAATTTGCCTAACCTCGGTCGT ACCTTCGM 

(D First strand synthesis _^ s JJ^| a iJ LI * 

CTTAAGCACGT^TAG^GTGTG 



-Second 



Type IIS restriction enzyme f 
-Synthesized cDNA- ^ 



Kteo I. 



Vector primer for 
reverse transcription 



Fourth 
Pne i A**> 1 



Bat II 



CpwmCTrGCAWTCTC4CACT 
CTT AAGCACGT^TAG^GTGTG 



.Synthesized cDNA 




Removal 



Ligation, cyclization 



H 



3' terminal portion of 
synthesized cDNA 




N 



/ "Mbol-digested end 
Bglll-digested end 

Fig. 3 
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c 



K 



^ 3' terminal portion of synthesized cDNA 
5mttc£tg^tc^ 

CTTMGCACCTaAG**&t*t«#<WM5^^ 

Digestion point of Bsgl 



GATrCGACCCACCATCQUCCTT 
XTTCAAATTTGC CTAACCTCGGTC GTACCTTC6AA 



I Bsg I 

j Pme I treatment 



Removal 



c 



GAATTC7TGCAG^TC^SK>^W?*0*VN 



PGR primer 

UPC-43)2&ner 




N^*t^»*NN*tWWAAlAAAAAA* 
WWNNHNWltlWW HI rTTTTTTTTTTTI IG CCTCAAA 



AAACG GATTGGAG CCAGCATG GAAGCTT 
TTTGCaAACCTCGOTCGTACCTTCGA? 



CD Blunting of ends 
<2> Ligation, cyciization 




GAATTCGTGCAGfrFGNNNNJ^W^ 
CTTAA(X^CGTCTAGr»*#*»WWMT^ 




:ag 



PCR primer \ 

ZZ-nwkeMt>o I I 




Primer sequences 

UPC-4S>26mer: ACGCCAGGGTTTTCCCAGTCACCACG 

ZZ-roakettx> I: ATGATTACGCCAAGCTTCCATGCTGGCTCCGATCCGTTT 



PCR amplification product 

tag 



Mbo I 



Fifth 



GATCNNNNNNNNNNNAAACGGATC 
CTAGNNNNNNNNNNNTTTGCCTAG 



2Z-a>okeMt>o I primer 



Fig. 4 
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M 



tag 



GATCNNNNNNNNNNNAAACG 

NNNNNNNNNNMTTTGCCTAG 



PCR primer 



/ 



tag 



J IL 



m IC 



tag tag tag tag tag tag tag tag 

PCR primer 



N 




■ « i i i i ' i 



Cloning vector for sequencing 




Fig. 5 



19 



EP 1 156 107 A1 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/JP00/00902 



A. CLASSIFICATION OF SUBJECT MATTER 

Inc. CI 7 C12N 15/10, C12Q 1/68 



According toTntemationai Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
Int. CI C12N 15/00-90. C12Q 1/00-70, C12N 9/00-99 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



PA 



Giguere S . et al . , '« Quant i cat: ion of equine cytokine mRNA 
expression by reverse transcription- competitive 
polymerase chain reaction", Vet Immunol Immunopathol 
(Jan. 4 1999), Vol.67 , No.l , p. 1-15 

Czarnota GJ.et al.,"High resolution microanalysis and 
three-dimensional nucleosome structure associated with 
transcribing chromatin". Micron {1997 Dec), 
Vol.28 , No. 6 , p. 419-431 

Berangere Virolon et al., "Serial microanalysis of renal 
transcriptomes' 1 , Proc . Natl . Acad. Sci . USA {Dec. 21 1999) , 
Vol .'96 , Nc.26 p. 152 So- 152 91 - - - - 

WO, 97/10363, Al (UNIV. JOHNS HOPKINS SCHOOL MEDICINE) , 

20 March, 1997 (20.03.97) 

& EP, 761822, A2 & US, 5695937, A 

& US, 5866330, A & JP, 10-511002, A 



1-8 



1-8 



1-8 



1-8 



1 Further documents are listed in the continuation of Box C. Q See patent family annex. 



Special categories of cited documents: 
document defining the general state of the an which is not 
considered to be of panicular relevance 
E" earlier document but published on or after the international filing 
date 

document which may throw doubts on priority claimfs) or which is 
cited to establish the publication date of another citation or other 
special reason (as specified) 
"O" document referring to an oral disclosure, use, exhibition or other 
means 

document published prior to the international filing date but laier 
than the priority date claimed 



*T' later document published after the international filing date or 
priori ry date and not in conflict with the application but cited to 
understand the principle or theory underlying the invention 

"X" document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive 
step when the document is taken alone 

"Y" document of particular relevance; the ciaimcd invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such 
combination being obvious to a person skilled in the art 
document member of the same patent family 



Date of the actual completion of the international search 
16 May, 2000 (16.05.00) 



Date of mailing of the international search report 
06,06.00 



Name and mailing address of the ISA/ 

Japanese Patent Office 

Facsimile No. 



Authorized officer 
Telephone No. 



Form PCT/1SA/2 10 (second sheet) (July 1992) 



NSDOCID: <EP U56107A1_I_> 



20 



